Where to Look and How to Describe: Fashion Image Retrieval With an Attentional Heterogeneous Bilinear Network
نویسندگان
چکیده
Fashion products typically feature in compositions of a variety styles at different clothing parts. In order to distinguish images fashion products, we need extract both appearance (i.e., “how describe”) and localization “where look”) information, their interactions. To this end, propose biologically inspired framework for image-based product retrieval, which mimics the hypothesized two-stream visual processing system human brain. The proposed attentional heterogeneous bilinear network (AHBN) consists two branches: deep CNN branch fine-grained attributes fully convolutional landmark information. A joint channel-wise attention mechanism is further applied extracted features focus on important channels, followed by compact pooling layer model interaction streams. Our achieves satisfactory performance three retrieval benchmarks.
منابع مشابه
a comparison of teachers and supervisors, with respect to teacher efficacy and reflection
supervisors play an undeniable role in training teachers, before starting their professional experience by preparing them, at the initial years of their teaching by checking their work within the proper framework, and later on during their teaching by assessing their progress. but surprisingly, exploring their attributes, professional demands, and qualifications has remained a neglected theme i...
15 صفحه اولAn Inference Network Approach to Image Retrieval
Most image retrieval systems only allow a fragment of text or an example image as a query. Most users have more complex information needs that are not easily expressed in either of these forms. This paper proposes a model based on the Inference Network framework from information retrieval that employs a powerful query language that allows structured query operators, term weighting, and the comb...
متن کاملNIST: An Image Classification Network to Image Semantic Retrieval
This paper proposes a classification network to image semantic retrieval (NIST) framework to counter the image retrieval challenge. Our approach leverages the successful classification network GoogleNet based on Convolutional Neural Networks to obtain the semantic feature matrix which contains the serial number of classes and corresponding probabilities. Compared with traditional image retrieva...
متن کاملWhere to look next and what to look for
In [Sch 96c] we have introduced the use of Multidimensional Receptive Field Histograms for Probabilistic Object Recognition. In this paper we reverse the object recognition problem by asking the question, "where should we look?", when we want to verify the presence of an object, to track an object or to actively explore a scene. This paper describes the statistical framework from which we obtai...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2021
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2020.3034981